Fixation and consensus times on a network: a unified approach 
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We investigate a set of stochastic models of biodiversity, population genetics, language evolution 
and opinion dynamics on a network within a common framework. Each node has a state, < Xi < 1, 
with interactions specified by strengths mij. For any set of niij we derive an approximate expression 
for the mean time to reach fixation or consensus (all Xi — or 1) . Remarkably in a case relevant to 
language change this time is independent of the network structure. 
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Mathematical models predicting biological and social 
change are becoming increasingly commonplace, with the 
last few years having seen an explosion of activity among 
statistical physicists in cultural dynamics (Ij] . One aspect 
of this work which is not widely appreciated is that sev- 
eral seemingly distinct phenomena can be described by 
very similar models: in some cases they can even be ex- 
actly mapped into each other 0]. Examples include bio- 
diversity jj], population genetics [1], opinion dynamics 
[H and language change [5j]. The common thread is that 
objects that come in different variants are copied from 
one place (an "island") to another according to some 
stochastic rule. If no new variants are created in the 
process (e.g., by mutation) and the number of objects 
does not grow without bound, one variant is eventually 
guaranteed to take over an entire population, or go to fix- 
ation in the genetics parlance. Changes in the network 
structure connecting different islands, and the stochastic 
rules used to choose the source and target islands, lead 
to a variety of scaling laws relating the number of islands 
and time to reach fixation (see, e.g., this Let- 

ter, we present a theoretical treatment of a very general 
stochastic copying model that includes many previously- 
studied cases and unifies the diverse fixation time results 
obtained so far. We also discuss a mapping to a particle 
reaction system from which it can be shown that our pre- 
diction for the fixation time, obtained by making various 
approximations, can in many cases be stated as a bound 
that simulations show is often saturated. We also discuss 
a key application of our findings — to a current theory of 
new-dialect formation 

To establish the basic features of the large class of mod- 
els we consider, we describe a prominent special case, 
Hubbell's model of biodiversity and biogeography 
Here there are only two islands: a metacommunity or 
mainland (island 1) and a local community (island 2). 
The objects are individuals which compete for a common 
resource (e.g. trees competing for space, sunlight and nu- 
trients [lO|) and the variants are different species. At 
regular time intervals an individual in island 2 is picked 



FIG. 1: Stochastic copying dynamics between two islands i 
and j forming part of a larger network. After a death on i, an 
object is replaced either with a copy from (a) the same island 
i or (b) a different island j (rate mij). 



at random to die and to be replaced by a copy of either 

(a) another individual picked at random from island 2, 
or (b) a migrant from island 1. Process (b) is assumed 
to happen less frequently than (a), and as mentioned 
above, if (b) is absent then the final state of the system 
is one which contains individuals of only one species. The 
number of individuals on island 2 is a constant, n, and 
new species are created on island 1, but not island 2, by 
mutation-like events. This model is a neutral theory; no 
one species is assumed to be "fitter" than another. 

The relationship between this model and simple neu- 
tral models of population genetics is well known [sj : in- 
dividuals are analogous to genes and species are types 
of genes (alleles). A more general model comprises a set 
of islands labeled i — 1, . . . ,N each of which contains n 
genes (in reality, n individuals each containing one copy 
of the gene of interest). For simplicity we assume that 
there is no mutation, so that no new alleles may be cre- 
ated. The alleles are labeled by a = 1,...,M. The 
dynamical processes, illustrated in Fig. [1] are as before: 
a death on island i followed by (a) a birth on island i, or 

(b) a migrant offspring from island j arriving on island i. 
There are various ways to parametrize these dynamics. 
As in [il, we let the parent (copied object) be taken from 
island j with probability fj. In process (b) the proba- 
bility the offspring (copy) lands on island i is taken to 
be proportional to my , which specifies a migration rate 



2 



within a standard continuous-time limit described in 
but whose details are unimportant here. At any time t, 
the state of the system can be given in terms of the frac- 
tion Xia{t) of genes on island i that are allele a. Then, 
the islands can be represented as nodes on a network, 
which in the opinion dynamics and language change mod- 
els are individuals i with opinions or language variants a 
expressed with frequency Xia- In these social contexts, 
fixation is sometimes called consensus. 

Using the formalism first developed for these systems 
in population genetics we describe the evolution in 
terms of a Fokker-Planck equation. Suppose, first of all, 
that there is only one island and no mutation. Then the 
only dynamical process is random genetic drift in which 
the frequencies Xa{t) diffuse on the interval [0, 1] due to 
the random sampling in the death/birth process. For 
simplicity, we will assume that there are only two alleles 
which have frequencies x and (1 — x), so the state of 
the system is described by the single stochastic variable 
X G [0, 1]. The probability that the system is in the state 
X at time t, P{x,t), satisfies the Fokker-Planck equation 
dtP{x,t) = dl[D{x)P{x,t)] where the diffusion constant 
is state dependent: D{x) = x{l — x)/2 [4]. Moving to 
the case of iV islands, with migration rate TOy from j to 
i, the Fokker-Planck equation for P{xi,t) now reads Q 
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where {ij) means sum over distinct pairs i,j. This may 
be generalized to M > 2 and to include mutation [2] , but 
Eq. Il]) will be sufficient for our purposes. 

Analysis of Eq. ([T]) is, on the face of it, a hopeless 
task since it has many degrees of freedom, Xi, interact- 
ing with arbitrary strengths ruij. However, much of the 
macroscopic dynamics is captured by the first and sec- 
ond moments of Xi{t). The mean, ai{t) = {xi{t)) can be 
found from Eq. ([1]) to evolve according to 
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where the equivalence holds if the diagonal elements 
ma are defined to be —'^j^i'm-ij- This matrix has a 
zero eigenvalue, which we will assume is non-degenerate. 
The associated right eigenvector has all elements equal 
to one, and the left eigenvector we denote Qi, so that 
^j^QiTnij = 0, and normalized such that ^j^Qi — 1. 
Then we find from Eq. ^ that the ensemble (noise- 
history) average of the collective variable 
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FIG. 2: Numerical solution for means a^, Uj and (co) variances 
aij = Pij — aiUj on a fully-connected network of A'' = 20 sites. 
Note the as converge at a time To much less than the mean 
fixation time T, and at time To, aij ~ for i j. These 
features become more pronounced as is increased. 



is conserved by the dynamics. Decomposing ai{t) in 
terms of its right eigenvectors we see that it approaches 
a constant independent of i as t oo. Since all Xi(t) 
tend to or 1 as i ^ oo, this is the probability of the 
allele fixing. From we see {£,{t)) also approaches this 
value in this limit and since ^ is conserved, the fixation 
probability is ^(0) ^E^. 

Numerical studies suggest that convergence of the en- 
semble average ai{t) to its asymptote ^(0) occurs on a 
much shorter timescale than the ultimate fixation of a 
variant, which in turn governs the rate of change of the 
second moments (3ij{t) = {xi{t)xj{t)), see Fig. [21 Eq. ([1]) 
implies for the latter 
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whilst the mean time to fixation, T, is given by the solu- 
tion of a backward version of the Fokker-Planck equation 
© 
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The assumption that the time over which all the ai con- 
verge. To, is much less than T (see Fig. [2]) leads to the 
following approximate treatment of this equation. We as- 
sume that T depends only on the state of the system at 
time To, and principally through ^(0). Changing to the 
^(0) variable using Eq. ([3]) we find that the first term on 
the right-hand side of Eq. ([5|) vanishes Q giving 
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2 = ^/^Q-x.(To)[l-x,(To)] 
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This equation still depends on the variables Xi at time 
To. We can estimate these by assuming that correlations 
between the nodes are absent, i.e. Pij = aiOtj = ^(0)^ i ^ 
j, and that the rate of change of the variance of xi is 
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sufficiently slow that the time derivative in Eq. ([4]) when 
i = j can be neglected (see again Fig. 2). Then Pa at 
time Tq can be estimated from Eq. 

By replacing Xi{l — Xi) in Eq. ([5]) by ai — Pa at time 
To, wc find [e(0)(l - ^(0))] (pT/di{Qf = -2/r where 
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The mean fixation time is obtained by integrating the 
equation for r(^(0)), with the boundary conditions 
T(0) = T(l) = to give 

mm = K(o) ine(o) + (1 - m) mi - am , 

r 

(8) 

which in tandem with ^ is our main result, derived for a 
large class of stochastic-copying processes on any network 
and arbitrary migration rates. It should be emphasized 
that the reduction from a system with (M — 1) degrees 
of freedom, to a system with only one degree of freedom 
is not imposed, but instead emerges from the dynamics. 

For orientation and as a check of these results, we ver- 
ify that specific choices of the parameters fi and rrnj give 
expressions which have been previously obtained. As a 
first check, we examine the case of voter dynamics on het- 
erogeneous graphs described in @, 0] ■ The voter model 
proper has each node chosen uniformly to be the recipi- 
ent of a new opinion, hence all fi = 1/N. This opinion 
is selected randomly from the node's neighbors, giving 
rriij = Aij/{2Nki), where Aij = 1 if nodes i,j are con- 
nected, ki is the degree of node i, ki = J2j=ii ^ijj ^^'^ the 
factor of 2N is a consequence of the continuous-time limit 
described in To calculate r, one needs Qi, the nor- 
malized zero left eigenvector of the matrix . One finds 
that this eigenvector equals Qi = ki/{Nk), where k is the 
mean degree and is included to normalize the Qi. Putting 
all this together we find from ^ that r = 'k^/2{Nk)'^. 
This corresponds with the results of @, [3| after taking 
into account the different choice of time units made in 
those works. 

An invasion process, where the source node of an opin- 
ion is randomly selected in each timestep, was also con- 
sidered in 0. This corresponds to the choice = 
Aij/{2Nkj), from which Qi may again be found. How- 
ever, the resulting expression for r via ([7]) does not sim- 
plify unless one additionally assumes (as in [1, 0) that 
node degrees are uncorrelated, Aij = kikj/{Nk). Taking 
/,; = as before, one finds from ^ the result of [7| 
in the limit TV — > oo. Our treatment however also ex- 
tends to networks with strong degree correlations, e.g., 
a star network that has a central node connected to all 
iV — 1 outer nodes, which in turn are connected only to 
the central node. For this network, ([Tj) predicts a fixa- 
tion time proportional to N^, confirmed by simulation 
data (not shown). By contrast, the approximation of [7] 



predicts a fixation time that, in our time units, increases 
quadratically with A'^. 

We also recover known results for link dynamics 0, [3] , 
where an opinion is copied in a randomly-chosen direction 
along a randomly-chosen link in each timestep. Here, we 
have fi = ki/N and = Aij/(2N) which implies that 
Qi — 1/N. In this special case it has been noted that the 
mean time to fixation does not depend on the network 
structure 0, Furthermore, our general result ^ 

includes the various scaling forms found in 

The generality of our results can be tested more strin- 
gently by choosing fi and rriij from various random dis- 
tributions in such a way that the numerators and de- 
nominators in ([7]) are of similar magnitudes. Simulation 
results (not shown) on Erdos-Renyi random graphs of 
varying densities are consistent with Eqs. ([7]) and 
([S]), except when T and Tq turn out to be of a similar 
order in N . 

A concrete application of the model is to an evolution- 
ary model of language change 0] ■ In fact it was our ex- 
perience in applying this model to the emergence of New 
Zealand English p^] that alerted us to the possibility of 
a general analysis and to some surprising behavior for a 
subclass of models. As indicated earlier, in this particular 
application the index i labels speakers who converse and 
so affect each others grammar. The probability that two 
speakers interact is given by a matrix Gij ; this is a reflec- 
tion of the topology of the network. Another factor Hij 
accounts for the weight that i gives to the utterances of j. 
It is the product of the dj and Hij that is equal to . 
While Gij is by definition symmetric, Hij need not be. In 
the application to the formation of New Zealand English, 
Trudgill [§] has postulated that the large quantity of data 
that exists on the emergence on this English variant 
may be explained by assuming that social factors which 
are modeled by Hij are unimportant, and therefore this 
factor may be replaced by a constant. If this is so, the 
rriij is symmetric; if the model is not socially-neutral, 
then m.y will not in general be symmetric. 

To test Trudgill's theory, we assume rriij is symmetric 
and using the results ([7]) and ([8]) check if the mean time 
to fixation is in accord with the data. A priori one would 
expect that the precise nature of the speaker network 
during the formation of New Zealand English would be 
required. However, remarkably, it turns out that in this 
case the time to fixation is completely independent of the 
network structure! This is because the symmetry of my 
implies that its left and right eigenvectors are propor- 
tional to one another, and so Qi = This together 
with J2j '"^ij /i implies a constant r from Eq. (O, 
which is confirmed by simulation results shown in Fig. 3 
(although see below for some caveats). To evaluate the 
plausibility of this socially-neutral model as a mechanism 
for new dialect formation requires in addition to this re- 
sult detailed consideration of human memory lifetimes 
that are beyond the scope of this article and discussed 
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FIG. 3: Fixation times per speaker within the socially-neutral 
utterance selection model (defined in the text) on various net- 
works with A*' nodes (see legend). 

elsewhere [TB]. Ultimately, we conclude that a purely 
neutral model is hard to reconcile with available empiri- 
cal data. 

We remark that our main result, Eq. ([7]), can also be 
obtained and interpreted within a backwards-time for- 
mulation. Consider for example two objects of the same 
type; by asking which objects these were copied from 
at some earlier time, one can reconstruct their ances- 
tral lineages which will hop between nodes as one goes 
back in time. Eventually, both objects will have been 
copied from the same 'parent', causing merger of the lin- 
eages. This process is called the coalescent in population 



genetics [17[ , but by viewing the lineages as particle tra- 
jectories, one can also recognize it as the A + A ^ A 
particle coalescence reaction of non-equilibrium statisti- 
cal mechanics [3l on a network. 

In this picture, the quantity r in Eq. ([7]) is the asymp- 
totic rate at which the last two unreacted particles co- 
alesce. The specific expression quoted above can be in 
fact obtained as a bound by means of a variational prin- 
ciple. Most usefully, this approach — whose details will 
be presented elsewhere [l^ — yields physical insight into 
when the approximations we have made to obtain ([7]) are 
valid. The key assumption is that the last two unreacted 
particles must each typically have been able to explore a 
large number of the sites of the network before the final 
coalescence takes place. Then, we regard the particles as 
well-mixed: the asymptotic probability of finding a par- 
ticle on site i relative to finding it on site j is assumed 
to be independent of the location of the other particle 
unless both particles are on the same site (since then 
they may react). In practice, we have found the presence 
of sufficient long-range connections in the network gives 
rise to a well-mixed two-particle state at late times in the 
particle-reaction process. On the other hand, a bottle- 
neck that dramatically extends the time taken to reach 
one subset of sites from another would likely lead to this 
assumption of well-mixedness breaking down. Although 
([7]) would then cease to be valid, the variational approach 
of [l^ could, in principle, be refined to take such net- 
work structures into account, a task we leave for future 
work. Finally, we remark that the pivotal assumption 



To <C T corresponds to taking the longest timescale in 
the backward-time dynamics to be that associated with 
the final coalescence reaction; that is, that the subsequent 
relaxation of the one particle state to its equilibrium oc- 
curs on a much shorter timescale. If the last two particles 
are well-mixed, we would expect this assumption to hold, 
since each particle would be close to the single-particle 
equilibrium state. 

In summary, we have provided an approximate theory, 
validated numerically on a range of networks, for cal- 
culating the fixation time within a general model that 
has features shared by physical, biological and social sys- 
tems. Since in reality specific applications will contain 
additional processes, it is of interest to extend the general 
approach we have described to cater for these. We have 
also briefly discussed how a particularly striking result — 
that fixation time is independent of network structure — is 
of direct relevance to current linguistic theory [^, [ist and 
hope that the trend whereby mathematical results for 
formal models of social behavior are applied in empirical 
contexts will continue, and increase, in the future. 

RAB holds an RCUK Academic Fellowship. AJM 
wishes to thank the EPSRC (UK) for financial support 
under grant GR/T11784. 
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